Cyberbullying Detection: Hybrid Models Based on Machine Learning and Natural Language Processing Techniques

نویسندگان

چکیده

The rise in web and social media interactions has resulted the efortless proliferation of offensive language hate speech. Such online harassment, insults, attacks are commonly termed cyberbullying. sheer volume user-generated content made it challenging to identify such illicit content. Machine learning wide applications text classification, researchers shifting towards using deep neural networks detecting cyberbullying due several advantages they have over traditional machine algorithms. This paper proposes a novel network framework with parameter optimization an algorithmic comparative study eleven classification methods: four seven shallow on two real world datasets. In addition, this also examines effect feature extraction word-embedding-techniques-based natural processing performance. Key observations from show that bidirectional attention models provide high results. Logistic Regression was observed be best among classifiers used. Term Frequency-Inverse Document Frequency (TF-IDF) demonstrates consistently accuracies techniques. Global Vectors (GloVe) perform better models. Bi-GRU Bi-LSTM worked amongst extensive experiments performed datasets establish importance work by comparing methods Our proposed outperform existing state-of-the-art approaches for detection, accuracy F1-scores as ~95% ~98%, respectively.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning in Natural Language Processing

This thesis examines the use of machine learning techniques in various tasks of natural language processing, mainly for the task of information extraction from texts. The objectives are the improvement of adaptability of information extraction systems to new thematic domains (or even languages), and the improvement of their performance using as fewer resources (either linguistic or human) as po...

متن کامل

Machine Learning in Natural Language Processing

Machine learning is concerned with acquiring knowledge from an environment in a computational manner, in order to improve the performance. (See Langley [15] for overview of ML.) Many forces over the past couple of decades have seen the blending of ML and NLP become increasingly common. The ever expanding availability of large corpora; more powerful computing resources; and a greater demand for ...

متن کامل

Machine Learning in Natural Language Processing

2 Tagging 5 2.1 PoS tagging . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1.1 Features for PoS Tagging . . . . . . . . . . . . . . . . 6 2.1.2 PoS Tagging Results for Different Languages . . . . . 7 2.2 Named Entity Recognition . . . . . . . . . . . . . . . . . . . . 8 2.2.1 One-step vs Two-step Classification . . . . . . . . . . 8 2.2.2 Features for NER . . . . . . . . . . . . . ...

متن کامل

Development of Machine Learning based Natural Language Processing System

For disease diagnostic knowledge base system (including Q&A, consistency checker, and informativity checker) requiring higher degree of strictness on the results of a query to the system, at the development stage, great efforts are necessary to improve machine learning based statistical performance tests, measured by precision and recall rates, on the training error and prediction. Performance ...

متن کامل

Natural Language Processing based Soft Computing Techniques

This paper presents the implementation of soft computing (SC) techniques in the field of natural language processing. An attempt is made to design and implement an automatic tagger that extract a free text and then tag it. The part of speech taggers (POS) is the process of categorization words based on their meaning, functions and types (noun, verb, adjective, etc). Two stages tagging system ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2021

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics10222810